Learning to identify antonyms
نویسنده
چکیده
1. Introduction Antonymy is a common lexical relation that is intuitively clear (if not always easy to define) for humans, but challenging for machines. In Natural Language Processing, antonymy detection has applications in tasks of understanding language, such as paraphrase detection, question answering, and textual inference. For that reason, WordNet (Fellbaum, 2005) includes some antonymy annotation; but the relation is relatively rare, and a quick manual inspection reveals that there are many more antonym pairs (including very common ones) than those shown in WordNet. It also becomes clear that the relation is somewhat vague; masculine and neuter, for example, are listed as antonyms in WordNet, but many native speakers of English would not intuitively consider these words antonyms. The presence of these antonyms is WordNet makes an interesting resource for supervised learning, however, which opens the possibility of trying to automatically extend the annotation. Identifying pairs of antonyms in corpora is the task I propose in this paper. The approach is to train a classifier to distinguish between sets of sentences that contain pairs of antonyms from sets of sentences that do not. The intuition behind it is that antonyms are often used contrastively in the same sentence. The highest-performing classifier obtains 84% accuracy. Because the literature on this task is limited, it is hard to rigorously compare the performance of the classifier with existing published results; however, it does seem to in line with, and perhaps above, results reported from other research. An important aspect of this approach is that knowledge-rich feature engineering was deliberately avoided. The reason for this is that an approach to detecting antonymy is much more productive if it fits into a general framework for learning other lexical relations, such as synonymy, hypernymy, entailment etc.. Therefore, whereas linguistic knowledge about the antonymy relation can be useful for the task (for example, features indicating the presence of morphemes such as un-or dis-would clearly be informative), I instead opted for a distributional approach, where the features are
منابع مشابه
On the use of antonyms and synonyms from a domain perspective
This corpus study addresses the question of the nature and the structure of antonymy and synonymy in language use, following automatic methods to identify their behavioral patterns in texts. We examine the conceptual closeness/distance of synonyms and antonyms through the lens of their DOMAIN instantiations.
متن کاملLearning Antonyms with Paraphrases and a Morphology-Aware Neural Network
Recognizing and distinguishing antonyms from other types of semantic relations is an essential part of language understanding systems. In this paper, we present a novel method for deriving antonym pairs using paraphrase pairs containing negation markers. We further propose a neural network model, AntNET, that integrates morphological features indicative of antonymy into a path-based relation de...
متن کاملA Multitask Objective to Inject Lexical Contrast into Distributional Semantics
Distributional semantic models have trouble distinguishing strongly contrasting words (such as antonyms) from highly compatible ones (such as synonyms), because both kinds tend to occur in similar contexts in corpora. We introduce the multitask Lexical Contrast Model (mLCM), an extension of the effective Skip-gram method that optimizes semantic vectors on the joint tasks of predicting corpus co...
متن کاملSound to meaning correspondences facilitate word learning.
A fundamental assumption regarding spoken language is that the relationship between sound and meaning is essentially arbitrary. The present investigation questioned this arbitrariness assumption by examining the influence of potential non-arbitrary mappings between sound and meaning on word learning in adults. Native English-speaking monolinguals learned meanings for Japanese words in a vocabul...
متن کاملA Uniform Approach to Analogies, Synonyms, Antonyms, and Associations
Recognizing analogies, synonyms, antonyms, and associations appear to be four distinct tasks, requiring distinct NLP algorithms. In the past, the four tasks have been treated independently, using a wide variety of algorithms. These four semantic classes, however, are a tiny sample of the full range of semantic phenomena, and we cannot afford to create ad hoc algorithms for each semantic phenome...
متن کامل